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Abstract 


Confidence  intervals  for  median  survival  times  are  derived  for  censored 
survival  data.  The  intervals  are  obtained  by  using  the  quantiles  of  the 
Kaplan-Meier  product  limit  estimator  and  have  the  same  Pitman  efficiency  as 
the  intervals  found  by  inverting  the  sign  test.  Two-sample  tests  and 
confidence  intervals  for  the  difference  in  median  survival  times  are  then 
developed  based  on  the  comparison  of  the  one-sample  confidence  intervals. 
Several  methods  for  choosing  the  confidence  coefficients  of  the  corresponding 
one-sample  confidence  intervals  are  developed.  The  Pitman  efficiencies  of 
these  two-sample  tests  are  the  same  as  that  of  the  median  test  proposed  by 
Brookmeyer  and  Crowley  (1982b).  The  procedures  can  also  be  used  for  the 
Behrens-Fisher  problem. 

Nonparametric  two-sample  inference  procedures  are  useful  in  comparing  the 
responses  of  treatment  and  control  groups.  In  medical  follow  up  studies  the 
data  are  usually  subjected  to  censoring.  While  many  of  the  two-sample 
procedures  have  been  extended  to  accommodate  censored  data  we  show,  in  this 
paper,  how  to  construct  two-sample  tests  and  confidence  intervals  based  on 
one-sample  confidence  intervals.  This  method  was  first  discussed  in 
He’: tmansperger  (1989a)  for  uncensored  data  where  a  confidence  interval  for  the 
difference  in  population  medians  is  constructed  by  subtracting  the  endpoints 
of  one  (one-sample)  interval  from  the  opposite  endpoints  of  the  other.  The 
test  then  rejects  the  null  hypothesis  of  equal  medians  if  the  one-sample 
intervals  are  disjoint.  Hettmansperger  used  the  s ign- interval  as  the  one- 
sample  interval  which  is  obtained  by  inverting  the  sign-test.  In  the  presence 
of  censoring  we  modify  the  sigri-ir.torval  to  the  so  called  quanti j.e- interval 
whose  endpoints  are  the  quantiles  of  the  product-limit  estimator  of  Kaplan  and 
Meier  (1958).  After  deriving  the  asymptotic  properties  of  the  quantile- 
interval  we  show,  for  specified  overall  level  a,  three  ways  to  select  the 


confidence  coefficients  for  the  one-sample  quantile  intervals.  For 


a  =  0.05  and  equal  confidence  coefficients  for  both  samples,  the  one-sample 
confidence  coefficients  are  in  the  neighborhood  of  .85  under  the  proportional 
hazard  model.  The  procedures  are  then  applied  to  data  from  a  colorectal 
cancer  clinical  trial  to  compare  four  treatments. 


X-.: 


Key  Words:  Product-limit  estimator,  median  test,  sign  test,  confidence 
interval,  quantile,  asymptotic  relative  efficiency. 


Section  i.  Introduction 

Nonparametric  two-sample  tests  based  on  ranks,  e.g.  median  and  Wilcoxon 
tests,  have  been  extended  by  Gehan  "( 1 965 ) ',  Efron  (1967),  and  Brookmeyer  and 
Crowley '( 1 982b)  among  others  to  accommodate  censored  survival  data.  While 
there  is  a  vast  literature  on  general  linear  rank  test  procedures  with 
censored  observations,  two-sample  confidence  intervals  have  received  much  les 
attention,  Nonparametric  confidence  intervals  for  the  ratio  of  two  scale 
parameters  were  derived  for  randomly  right  censored  data  by  Wei  and  Gail 
(1933)'  based  on  the  idea  of  Hodges  and  Lehmann  96r3') .  Their  method  can  also 
be  applied  to  obtain  confidence  intervals  for  the  difference  of  two  location 
parameters.  The  confidence  sets  were  computed  by  a  numerical  method  based  on 
a  grid  search  and  may  not  yield  an  interval.  In  this  paper^we  shall -focus  on 
the  problem  of  testing  and  confidence  intervals  for  the  difference  of  two 
population  medians  with  randomly  right  censored  data.  Inferences  for  other 
two-sample  location  and  scale  parameters  will  be  discussed  briefly  in  section 


Following  the  idea  of  Hettmansperger  (1984a),  we  shall  show  how  to 

construct  a  two-sample  test  and  confidence  interval  based  on  one-sample 

confidence  intervals  for  the  median  of  each  population.  Let  [L^,  U^]  and 

IL  ,  U  ]  denote  one-sample  confidence  intervals  for  the  population  medians  0 
y  y  X 

and  9  with  confidence  coefficients  Y  and  Y  ,  respectively.  A  confidence 
y  X  y  ^ 

interval  for  A  =  9  -  9  ,  the  difference  of  the  population  medians,  can  be 

y  ^ 

obtained  as  [L  -  U  ,  U  -  L  ]  with  coefficient  1  -  a.  The  overall 
y  X  y  X 

coefficient  1  -  a  obviously  depends  on  the  respective  one-sample 
coefficients  Y^  and  Y^.  Bonferoni's  inequality  can  be  applied  to  obtain  a 
bound  on  the  desired  overall  level  but  it  is  usually  too  conservative.  Using 
cno-""smpIe  si j^n- intervals  and  their  asymptotic  distributions,  Hettmansperger 

n  )  Qho,vcd  riuw  t.o  seLHcr.  f  ann  ^  for  a  opcjCLfied  a.  For  exa;iiuie;  if 

X  y  > 
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.05. 


one  chooses  equal  coefficients  then  Y  =  Y  is  about  .84  for  a  = 

X  y 

Further,  the  size  a  two-sample  test  of  A  =  0  vs.  H^:  A  5*  0  rejects  Hq 

when  the  one-sample  intervals  are  disjoint. 

In  the  presence  of  censoring,  the  sign-intervals  need  to  be  modified.  To 

motivate  the  construction  of  a  sign-interval  with  censored  data  let  us  first 

consider  the  situation  when  there  is  no  censoring.  Let  X,,.  ^  ...  ^  X,  ,  be 

C 1;  Cm ; 

an  ordered  sample  from  a  continuous  distribution  with  unique  median  9^.  The 
sign-interval  is  LL^,  U^]  =  [X,..,  X,  .  with  confidence  coefficient 
Y^  =  1-2  p(S  <  d),  where  S  has  a  binomial  distribution  with  parameters 
m  and  .5  and  d  is  called  the  depth  of  the  sign-interval.  Given  any 
distribution  function  F(t),  define  its  left  continuous  inverse  to  be 


If. 


ii 


i 


ti; 


F  (p)  =  inf{t:  F(t)  ^  p}  for  O^p^l 


(1.1) 


One  then  notices  that  X.^,  =  F  '(d/m)  and  X,  .  =  F  ' ( (m-d+1 )/m) ,  where 

Cd)  m  (ra-d+1 )  m 


F^( • )  is  the  empirical  distribution  function.  In  the  presence  of  censoring. 


the  nonparametric  counterpart  of  F  (•)  is  the  product-limit  estimator 

m 


(PLE)  F^(*)  proposed  by  Kaplan  and  Meier  (1958).  It  is  now  clear  that,  for  a 


specified  depth  d,  one  can  use 


:l  ,  U  ]  =  [F“'(d/m),  F’'((m-d^-1  )/m)] 
X  X  m  m 


(1.2) 


as  a  confidence  interval  for  the  median.  The  confidence  coefficient  Y  will 

X 


no  longer  be  available  from  a  binomial  table  and  will  have  to  be  approximated 
from  a  normal  table  due  to  the  censoring  scheme.  It  should  be  mentioned  here 
that  the  confidence  interval  (1.2)  is  different  from  the  confidence  region  R 

a 

based  on  the  sign  test  in  Brookmeyer  and  Crowley  (1982a).  For  this  i^eason  and 
2’  is  obtained  from  estimates  of  the  quantiles  of  the  true 


■  i  nee 
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distribution  function,  hereafter  we  shall  refer  to  (1.2)  as  the  quantile- 
interval  with  depth  d.  The  confidence  region  need  not  be  an  interval  so 
they  suggested  the  use  of  only  the  interval  part  of  the  region  and  denoted  it 
by  This  truncation  of  the  confidence  region  to  an  interval  complicates 

the  issue  of  establishing  two-sample  inference  procedures  based  on  one-sample 
procedures.  We  therefore  consider  the  quantile- intervals ,  and  we  examine  the 
effect  of  censoring  by  comparing  our  results  to  the  uncensored  case  studied  by 
Hettmansperger  (198i1a).  One-sample  confidence  intervals  for  the  median  which 
are  based  on  the  bootstrap  method  were  derived  by  Reid  (198l). 

Let  [Ly,  Uy]  be  the  quantile-interval  constructed  as  in  (1.2)  for  an 

independent  y  sample  with  confidence  coefficient  The  selection 

of  and  y^  for  a  specified  a  is  given  in  section  3  after  deriving  the 

asymptotic  distributions  of  the  relevant  statistics.  Three  different  ways  to 

specify  and  Y^  are  given.  Under  the  proportional  hazards  model  some  values 

of  Y  and  Y  are  tabulated  in  section  4.  In  particular,  for  a  =  0.05  and  equal 
X  y 

coefficients  (Y  =  Y  ),  the  coefficients  Y  and  Y  are  in  the  range  of  .83  to 
.88  even  for  heavy  censoring  and  for  many  pairs  of  sample  sizes.  Our 
asymptotic  results  are  facilitated  by  i.i.d.  representations  of  the  product- 
limit  estimator  and  its  quantile  process  due  to  Lo  and  Singh  (1986).  Their 
results  are  summarized  in  section  2.  In  section  5  we  apply  the  procedures  to 
data  from  a  phase  III  colorectal  cancer  clinical  trial. 

It  was  shown  in  Hettmansperger  (1984a)  that  the  tests  based  on  sign- 
intervals  have  the  same  efficiency  as  Moods'  (1950)  median  test.  In  the 
presence  of  censoring.  Moods'  median  test  has  been  extended  in  several  ways  by 
Prentice  (1973)  and  Brookmeyer  and  Crowley  (1982).  Gastwirth  and  Wang  (1987) 
extend  the  control  percentile  test  which  is  also  a  median  type  test  when  the 
specified  percentile  is  the  median.  All  these  tests  turn  out  to  have  the  same 


Pitman  efficiency  as  our  test 
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Section  2.  Preliiiiinapies 


Let  us  first  introduce  the  two-sample  location  problem  with  randomly 
right  censored  data.  Let  X°,  . X°  and  Y°, 

independent  random  samples  from  continuous  life  distribution  functions 

F°(t)  =  L(t  -  9  )  and  G°(t)  =  L(t  -  9  ),  respectively,  where  L(*)  is  a 
X  y 

continuous  distribution  function  with  unique  median  zero.  Hence, 

G°(t)  =  F°(t  -  A),  where  A  =  9^  -  9^.  Let  ,  C2 . and  ,  D2,  ...,  0^^ 

denote  two  independent  random  samples  from  arbitrary  censoring  distribution 

functions  H(*)  and  K(*),  respectively.  In  the  random  right  censoring  model, 

one  observes  {(X^^,  e^^),  i  =  1,  ...,  m}  and  6^ ) ,  j  =  1,  ...,  n},  where 

X.  =  min  (X°,  C.)  =  X°  A  C.,  e.  =  I(X.  =  X°) ,  Y.  =  min(Y°,  D.)  =  Y°  A  D., 

1  I’l  1  I’l  1  ij  JJ  J  J 

=  I(Yj  =  Y°)  and  !(•)  is  the  indicator  function.  The  censoring  times  will 
be  assumed  to  be  independent  of  the  corresponding  lifetimes,  and  hence  the 
survival  functions  of  and  Yj  are  F(t)  =  F'^(t)  H(t)  and  G(t)  =  G°(t)  K(t) 
respectively,  where  for  any  distribution  function  W(0,  its  survival  function 
is  W(t)  =  1  -  W(t). 

Denote  the  Kapian-Meier  (1958)  product-limit  estimators  (PLE)  of  F°  and 

G'^  by  F°  and  G°  .  With  proper  choice  of  d^  and  d„  to  be  discussed  later,  the 
m  n  ^  X  y 

quant i le- intervals  for  9  and  9  with  confidence  levels  Y  and  Y  respectively 

X  y  X  y 


[L  ,  U  ]  =  [F°  \d  /m),  F°  "^((m  -  d  +  l)/m)] 
X  X  m  X  m  X 


(2.  1  ) 


[L  ,  U  J  =  [G°  \d  /n),  G°  "’((n  -  d  ^  l)/n)]  , 
y  y  n  y  n  y 


O  "'1 

where  the  inver’se  function  F  is  defined  in  (1.1) 

m 


For  specified  a,  called 


i..*.  -V.-.A.I*.  A  ^  .  r_  .  ft  ■  A  M  ^ 


the  overall  error  rate,  we  shall  show  in  the  next  section  how  to  pick  the  two 


quantile-intervals  so  that; 

(a)  The  level  a  test  for  :  A  =  0  versus  H,:  A  *  0 

0  A 

rejects  Hq  whenever  the  quantile-intervals  are  disjoint. 

(b)  The  (1-a)  confidence  interval  for  A  is 


(2.3) 


The  choices  of  and  Y^  (or  equivalently  of  d^  and  d^)  which  are  given 
in  the  next  section,  are  based  on  the  normal  approximations  of  the  endpoints 
of  the  quantile- intervals.  We  shall  now  introduce  the  background  of  such 
approximations  which  are  based  on  i.i.d.  representations  of  the  PLE  F°  and  its 
quantile  function  ^(O  due  to  Lo  and  Singh  (1986). 

Let  F^(t)  =  P(X^  <  t,  =  1)  and  G^(t)  =  P(Y^  <  t,  6^  =  1)  be  the 
subdistribution  functions  for  the  uncensored  observations.  For  positive  reals 
X,  t  and  for  t  taking  values  zero  or  one,  let 


I(x<t,e  =  1  )  _  jxAt  ^ 

F(x)  °  [F(s)]^ 

Similarly,  one  can  define  n(y,  6,  t)  using  instead  G°,  G  and  G^  in  (2.4).  The 
following  proposition  is  an  extension  of  Theorem  1  of  Lo  and  Singh  (1986).  A 
proof  can  be  found  in  Lo,  Mack  and  Wang  (1986). 


(2.4) 


C(x,e,t)  =  F°(t)  I 


Proposition  1 .  For  any  T  such  that  F(T)  <  1 ,  we  have 


1  ^ 

-5  f  =!•  *  "«"■> 

where  sup  lR_(t)|  =  0()!,n  m/m)  a.s.  and  S,n  denotes  the  natural  logarithm. 
0<t<T 


Remarks .  1.  The  E,(X^,  t)'s  are  i.i.d.  with  mean  0  and  variance 


[F°(t)]2  jl 


dF^ (s) 
[F(s)]^ 


1  /2  - 

2.  The  process  m  converges  weakly  to  a  mean  zero  Gaussian 

process  Z(t)  which  has  continuous  paths  with  probability  one  on  D[0,T]. 

The  next  proposition  is  a  slight  variant  of  the  quantile  representation 

in  Theorem  2  of  Lo  and  Singh  (1986)  which  is  an  extensioi.,  in  the  presence  of 

censoring,  of  Bahadur's  representation  for  the  sample  quantiles  (Serfling 

1980,  p.  91).  The  current  variant  is  similar  to  Ghosh's  extension  of 

Banadur's  representation  (Serfling  1980,  p.  92)  as  we  are  concerned 
1  /2 

wirn  n  -  asymptotics,  and  the  proof  can  be  obtained  along  the  same  lines. 


Proposition  2.  For  any  0  <  p  <  1,  if  the  derivative  f°(t)  o£  F°(t)  is 
continuous  and  positive  at  t  =  F°  \p),  we  have 


~  1  /  „0  -  1  ,  . 
r  i.  pj  -  r  (p)  = 
m 


1  ;  ''‘p”  ,  -1/2, 

-  2 - z -  +  o„(m  ) 

""  i  =  l  f°(F°  “^p)) 


Ijf''  \p))/f°(F°  ~\p)) 


,  -1/2, 

Op(m  ) 
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■  .S-  .  A-^  ^  T  t  A--. 


'•/ "S  * 

\  .  JlV  .V-  K*. 


■I  -  A ..  f.  -  n 


■  Xm  t  .A,  J  -  W-V-W.W. 


Similar  results  can  be  obtained  for  the  y-sample  and  we  will  denote  tne 


mean  process  by  n  (t). 

n 

Section  3-  Asymptotic  Behavior  of  Two-sample  Tests  and  Confidence  Intervals. 

Consider  the  two-sample  location  model  in  Section  2.  In  this  section  we 
shall  assume  that  the  distribution  L  is  continuously  differentiable  near  zero 
with  density  2,(0)  >  0.  The  asymptotic  results  in  section  2  of  Het tmansperger 
(1984a)  will  be  extended  to  the  censored  data  case. 


Theorem  1 ■  Let  be  a  positive  constant  and 

1  /2 

d  =  m/2  1  0.5  -  Z  m  /2  .  (3.1) 

XX 

Then 

^x  "  -m  “  ^x  ■  ^  . 

(3.2) 

U  =  "'((m  -  d  *  1)/m)  =6  ^  Z  /[2m’ ^^!i(0)  ]  -  ^  (9  )/i(0)  +  o„(m"’^^). 

xm  X  XX  mx  P 

Proof.  3y  proposition  2, 

L.  =  F°  '  (d  /m)  -  ^  ’(d  /m))/!l(F°  '  (d  /m)  -  9  )  +  o_(m 

X  X  m  X  x  X  P 

=  9  -  :i/(2m)  -  Z  /(2m’ ''^)]/i(0)  -  l\  ( F°  ”’ (d  /m)  )  -  ^  (9  )]/2,(0) 

X  X  m  X  m  X 

-  ^  '9  ;/2.(0)  1  :i/2,(0;  -l/2,(F'’(d  /m)  -  9  )]  ^  ( F°  "’(d  /m))  +  o„(m"’ 

mx  oxx  mxP 

-  ,  1  /2  - 

r'lnce  ^  ^  mean  3  for  all  t  and  the  process  (m  ^^(t),  OStST}  converges 

weaxly  a  con-in'ious  path  Caussian  process,  it  follows  that 
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•f. 


Y 

■u* 

> 

I  • 


tv 

f. 

f. 


Also  £  (F'^  \cl  /m))  =  o„(m  and  !!,(•)  is  continuous  at  zero  implies  that 

m  X  P 

-1/2 

the  second  to  last  term  is  also  OoCm  ).  We  have  thus  proved  in  (3.2) 
and  Ly  is  similar.  )_ 


-1  /2 

Remark.  The  remainder  Op(m  )  in  (3.2)  and  (3.3)  can  be  replaced  by 

-1/2  3/4 

o(m  )  a.s.  with  a  modified  proof  or  an  even  better  rate  (say  0(()ln  m/m)  ) 

a.s.)  if  we  further  assume  that  the  second  derivative  of  L  exists.  While  all 

the  asymptotic  resuld.s  on  convergence  in  probability  in  this  section  can  be 

generalized  to  convergence  a.s.,  we  shall  not  attempt  to  do  so  as  the  current 

forms  suffice  for  our  purpose. 

Statements  similar  to  Theorem  1  apply  to  Ly  and  Uy  as  well  with 
corresponding  Zy  and  dy.  Let 


r®x  -  -2 

"x  =  Jo 


(3.3) 


then  Var(;  (9  ))  =  t  /(Am)  .  The  central  limit  theorem  applied  to  £  (9  )  then 
m  X  X  ^m  X 

gives : 


Coro’lau’y.  As  m  + 


i  1 


1  /2 

m  (L^  -  0^)  converges  weakly  to  a  Normal  distribution  with 
mean  -  Z,^/[2  !,(0)]  and  variance  11(0)]^. 


1  /2 

li)  m  (U.,  -  L,.)  converges  in  probability  to  Z../2,(0)  . 

X  X  ’  "  ■  "  X 


( iii ) 


f 


f 


k 

I 

f 


P(0..  <  L.J  converges  to  ♦(-Z.yCT.J  ),  where  $  is  the  standard 

X  X  "  ~  X  X  ~ '  ”  It-- 

normal  distribution  function. 


The  approximate  confidence  coefficient  for  [L^^,  Uj^]  is  thus 


''x  -  '  - 


rt  should  be  noted  that  =  1  in  the  uncensored  case. 

The  statements  in  the  Corollary  also  applies  to  L  and  U  with 
0  j  y 

[G{s)]“^  dG^(s). 


'1 


Let  N  =  n  ^  m,  and  assume  that  m/N  tends  to  X,  0<X<1  ,  as  m  and  n  tend  to 

infinity.  The  following  theorem  relates  a,  the  overall  error  rate,  to  and 

Z.,  which  determine  Y  and  Y  . 

/  X  y 


Theorem  2.  P(U,.  -  L..  <  A)  +  P(L..  -  U..  >  A)  -*•  2$(- 
y  X  y  X 


(1-X)^/2  Z  .  x^/^  z 
_ X _ y- 

[(l-X)  +  X 


Proof,  By  tne  Corollary,  N^'^^(U  -  0  )  -  -  9  )  converges  weakly  to  a 

y  y  XX 

1  /p  1 /p 

nor^ial  distribution  with  mean  Z  /[2Jl(0)(1-\)  ]  +  Z  /[2Z(0)  k  ]  and 

y  ^ 

1  /p  p  1 /p  p 

variance  x.  /[2£(0)  ( i -A )  ]  x  /L2£(0)A  ]  .  Hence 

y  ^ 


P(U  -  L  <  A) 

y  X 

1/2  1/2  ^  ^  Z 

=  P(N  (U  -  9  )  -  N  (L  -  0  )  <  0)  i>[ - ^ - TTV^] 

^  ^  [XT.  (l-X)T  J^/2 

y  X 

limiiariy,  P'^L,  -  U,  >  A)  converges  to  the  same  limit. 

y  X 

Cefine  Z  by  a  =  21)(-Z  ),  and  Let  Z^,  Z^  satisfy 
"1  ot  X  y 


(3.5) 


( 1-A) 


Z  = 

y 


Z  [{1-X) 
a 


Then  for  depths  d^,  d„  given  by  (3.1),  we  have 

•  j 


P(L  -  U  <  A  <  U  -  L  )  -►  1-a  , 
y  X  y  X 

and  under  A  =  0,  P(L  >U)+P(L  >U)-»'a.  Hence  [L  -  U,U  -  L] 

0  xy  yx  yx  yx 

is  a  confidence  interval  for  A  with  approximate  confidence  coefficient  (1-a), 
and  the  test  which  rejects  Hq  whenever  0  is  not  in  the  confidence  interval,  or 
equivalently  whenever  the  two  quantile-intervals  are  disjoint,  has  approximate 
size  a. 

The  condition  (3.5)  provides  an  infinite  number  of  choices  for  Z.,  and  Z„. 

^  y 

The  following  Theorem  shows  that  the  asymptotic  length  of  any  of  the  two- 
sample  confidence  intervals  does  not  depend  on  Z„  and  , 

Theorem  3.  Let  A  denote  the  length  of  the  two-sample  confidence  interval. 

Then  A  ■*  Z_[(1-A)t  +  [X  ( 1 -A  )  ]  ^ '^^2,  ( 0 ) }  in  probability. 

^  X  y  '  “ 

Proof.  A  =  N^^^(U  -  L  )  -^  n'^^(U  -  L  ) 

-  y  y  XX 

-  Z  /C(1-X)^^^2,(0)]  -  Z  /Cx’''^2,(0)]  , 

y  ^ 

by  Corollary  (ii).  The  theorem  now  follows  from  (3.5).  | _ [ 

We  now  consider  three  ways  to  specify  Z  and  Z  . 

X  y 
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ZvIvOvivl-. 


UV  lA.  Li?- 


i-kJW. 


Quant  lie- intervalg  with  equal  confidence  coef f loients : 

If  we  choose  Y  =  Y  ,  then  Z  =  Z  .  The  condition  (3.5)  then 

xy  XX  yy 

implies 


1/2  ^  \  1/2  ,  ,1/2 

,  'x  ,  'x  *"’x  *  ”V> 

...  ,1/2  ^  r-.  ,1/2  “  .  J/2  ,  A/2  *  ’ 

[(1-X)t  ]  +  LXt  ]  (ni  )  +  (mT  ) 

X  y  X  y 


1/2  ,  ,1/2 

Z  =  (t  /t  )  Z  =  - ^—T75  •  2 

^  y  "  (nt  ^  ,1/2  u 

X  y 


Quantile-intervals  with  equal  asymptotic  lengths: 

-1/2  -1/2 

In  this  case,  m  =  n  ^y  '  Combining  this  with  condition 


(3.5) ,  we  have 


-X,  X  1^/2  ,  ,1/2 

L(1-\)t  *  XT  ]  (nT  +  mt  ) 

^  [m(l-A)]^/2  ^  (^^^1/2  ^^1/2 


Z  =  (n/m)^  Z 
y  X 


(nT  +  mT  ) 
X  _ y 


Quantile  intervals  with  equal  depths; 

In  this  case  d  =  d  ,  and  (3.1)  implies  m  -  n  =  m^^^  Z„  -  n^'^^  Z„. 

Ay  X  y 

Condition  (3.5)  now  gives 


r  X  ,  ,  ,  ,  i1  /'2  1  /2  ,1/2,  , 

[(1-X)t  +Xt]  n  Z  +X  (m-n) 
_ X _ y _ ot _ 

r  ^  7l/2  ,  .  .1 /2 

[n ( 1 -X  )  ]  +  LmX  ] 


*•  .N  /•  w*.  A  .  •  »  -  A  A  A  V  A  A  A  .v  w'v  u%  A  A  A  A  .  •  A  A  A  .VA,  A  .VA  AAAV 


k? 


y. 


V 


K 


A 


I 


K 


If.. 


\ 


■.V 


V 


fc; 


,  J/2  1/2  ,  1/2.  , 

(nx  +  mx  )  n  Z  +  m  (m-n) 
X _ y _ Qt _ 


n  +  m 


,  ,1/21/2.^  ,  , 

,  Z  "  (m-n)n 

„  ,  ,  J /2  „  ,  ^  -1/2  X  y  a 

Z  =  (m/n)  Z  -  (m-n)n  =  - 

y  X  n+m 


1  /2 


(3.8) 


Remarks.  1.  When  the  sample  sizes  are  equal,  (3.8)  reduce  to  (3.7)  and  in 


J  /2 


both  cases  Z  =Z  =Z  (x  +x)  /2.  However,  unlike  the  uncensored  case 

X  y  a  X  y 


(where  x  =x  =1),(3.6)  does  not  reduce  to  (3.7). 
X  y 


2. 


1/2 


Under  x  =  x  ,  the  Z„  in  (3-6)  and  (3-7)  equals  x  times  the  Z„  for  the 
X  y  ^  X  ^ 


1/2 


corresponding  uncensored  case,  but  in  (3. 4)  the  x^  cancels  and  the 


confidence  coefficients  Y  and  Y  remain  the  same  as  the  uncensored 

X  y 


3ase.  Neither  Z^,  Z,,  nor  Y  ,  Y  produced  by  (3.8)  remains  the  same  as 
•'  y  X  y 


the  uncensored  case.  If  in  addition  m  =  n,  all  three  solutions  reduce 


1  /2 


to  Z  =  Z  =  Z  X  /2.  Note  that  if  the  censoring  schemes  are  different 
X  y  a  X  ® 


for  tne  two  samples,  in  general  x  *  x  even  under  A  =  0. 

X  y  u 


3. 


It  should  be  noted  here,  as  implied  by  Theorem  1  and  (3.5),  that  the 
implementation  of  the  confidence  interval  and  test  for  A  rely  only 


on  x^  and  x^  and  not  on  the  density  at  the  median.  Hence  our  procedures 


avoid  estimating  the  density  at  the  median  which  is  required  by 
Brookmeyer  and  Crowley  (I982ab)  and  Wei  and  Gail  (1983).  All  procedures 


essentially  require  the  consistent  estimation  of  x^  and  x^ .  In  practice 


this  can  be  done  in  several  ways,  for  example, 
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(3.9 


^  md . 

■  x.L  N(x^)[»(x‘)  .  a.: 

1  X 


where  m  =  F  (1/2)  is  the  x-sample  median,  d.  is  the  number  of 

m 

observed  deaths  at  and  N(X. )  =  I  I(X.  >  X. )  is  the  number  of 

observed  survival  times  larger  than  X. .  The  estimator  t  is  similar  to 

1  X 

formula  (2)  of  Brookmeyer  &  Crowley  (1982a).  See  also  (5.2)  of 
Brookmeyer  and  Crowley  (1982b)  for  an  alternative  estimate. 

4.  Mood's  median  test  was  shown  to  be  a  special  case  produced  by  sign- 
intervals  in  Hettmansperger  (1984a).  With  censoring,  the  median  test 
statistics  can  no  longer  be  expressed  in  terms  of  the  differences  of 
order  statistics  as  in  formula  (6)  there.  It  is  not  clear  whether  the 
median  test  by  either  Prentice  (1978)  or  Brookmeyer  and  Crowley  (1982b) 
can  be  produced  by  our  quantile-intervals. 


Section  4.  Tables  for  confidence  coefficients  and  a  recommendation 

Confidence  coefficients  and  corresponding  to  a  two-sample  test  of 
A  =  0  versus  A  *  0  at  level  a  =  .05  will  be  provided  in  this  section 
for  the  three  intervals  described  in  the  previous  section.  Due  to  censoring. 


the  confidence  coefficients  Y  and  Y  depend  on  the  value  of  x  and  x  so  we 

X  y  X  y 

shall  only  consider  a  special  case,  the  proportional  hazards  model,  where  x^ 
and  X  can  be  evaluated  easily.  Let  H(t)  =  [F°(t)]'",  K(t)  =  [G°(t)]^,  then 

y 

r  + 1 

T  =  (2  -  1)/(r  +  1)  and  -rr  =  probability  of  x-censoring  =  r/(r  +  1). 

X  X 

<5  +  1 

Similarly,  x^  =  (2"  -  1)/(3  +  1)  and  =  probability  of  y-censoring  = 

s/(s  1  )  .  We  shall  consider  r,  s  =  1/3,  1,  3,  which  correspond  to  25^,  50? 
and  75?  censoring. 
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•‘7jL 

iWn 


(Tables 1-3  here) 


Note  that  t  =  t  when  r  =  s,  then  (3-6)  and  (3.7)  give  the  same  Y  and  Y  as 
X  y  X  y 

in  the  uncensored  case.  Hence,  the  first  two  columns  which  corresponds  to  r 
s  in  Tables  1  and  2  should  be  the  same  as  those  columns  corresponding  to 
equal-coefficient  and  equal  -  length  case  in  Table  1  of  Hettmansperger 
(1984a).  The  discrepancy  is  due  to  computational  accuracy.  In  the  equal 
depth  case  (Table  3).  (3.8)  depends  not  only  on  the  ratio  of  sample  sizes  but 
also  the  values  of  m.  We  chose  m  =  90  so  that  the  corresponding  n  is  an 
integer.  Our  computation  shows  that  the  pair  of  coefficients  are  very  high 
(over  .95  for  m/n  ^  1.5  and  close  to  1  for  m/n  ^  2)  for  unequal  sample 
sizes.  Therefore  we  only  provide  the  results  for  ratio  of  sample  sizes  (m/n) 
that  are  1  and  1.5  in  Table  3. 

As  can  be  seen  from  the  tables,  the  confidence  coefficients  in  Table  1 
appear  to  be  quite  stable  (between  .83  and  .88)  for  various  ratio  of  sample 
sizes  and  degrees  of  censoring  compared  to  those  in  Tables  2  and  3.  For  this 
reason  the  equal  coefficients  solution  (3.6)  seems  preferable  to  the  others 


and  we  recommend  its  use  in  applications.  In  an  unpublished  paper  of  Tableman 

and  Hettmansperger  (1988)  the  equal  lengths  case  may  be  preferable  in  some 

uncensored  cases  based  on  Bahadur  efficiency.  In  the  censored  case,  the  equal 

lengths  solution  (3.7)  may  be  preferable  to  the  equal  coefficients  solution 

(3.6)  since  one  only  needs  to  estimate  t  +  t  when  m  =  n  or 

X  y 

(1-A)t  +  At  in  general,  while  in  (3.6)  one  has  to  estimate,  in  addition, 

X  y 

and  Ty  separately.  The  choices  of  and  Zy  (or  equivalently  Y^  and  Y^ ) 
for  (3.8)  are  not  recommended  since  they  are  more  complicated  than  (3.7)  even 
when  =  Ty  and  they  require  much  higher  coefficients  (see  Table  3)  for 
unbalanced  sample  sizes  compared  to  those  coefficients  provided  in  Tables  1 
and  2 . 


Section  5.  An  Example 


We  shall  apply  the  two-sample  procedure  in  section  3  to  data  from  a  Phase 
III  colorectal  cancer  clinical  trial  (Ansfleld  and  Klotz  1977).  The  data 
analyzed  were  an  updated  version  reported  in  the  Central  Oncology  Group  Final 
Report  (COG  7030),  Spring  1977.  These  data  were  also  analyzed  by  Brookmeyer 
and  Crowley  (1982a)  to  compare  confidence  intervals  for  the  median  survival 
times  of  four  dosage  regimens  5-fluorouracil .  Refer  to  Brookmeyer  and  Crowley 
(1982a)  for  a  description  of  the  data.  A  summarization  of  the  data  is  given 
in  Table  k  together  with  95?  confidence  intervals.  The  median  survival  time 

(weeks)  and  the  quantile-intervals  are  computed  by  a  continuous  version  of  the 

* 

Kaplan-Meier  estimator,  denoted  by  F^,  discussed  in  Remark  1  of  section  6  and 
is  therefore  slightly  different  from  the  median  in  Table  7  of  their  paper. 


(Table  4  here) 


For  y  =  ,95  the  in  (3*4)  is  equal  to  1.96  times  (t  )  where  t  is 
estimated  by  t  in  (3.9).  The  depth  d^  in  Table  4  is  then  obtained  from 

(3,1)  which  for  treatment  1  is  19.09.  The  quantile-interval  [L^,  U^)  is 

*  *  -1  ♦ 
obtained  by  L  =  F  (d  /m)  and  U  =  F  ((m  -  d  +  1)/m),  where  F  is  the 
xmx  xm  X  m 

linear  interpolated  Kaplan-Meier  estimate  as  discussed  in  Remark  1  of  section 

6.  A  table  for  the  survival  estimate  S°  =  1  -  F°  is  available  for  treatment  1 

m 

in  Table  3  of  Brookmeyer  and  Crowley  (1982a).  Our  estimate  F°  (which  is  not 

m 

included  here)  coincides  with  theirs. 

To  make  pairwise  comparisons  at  the  5  per  cent  significance  level  we  need 

♦■he  confidence  coefficients  Y  and  Y  and  the  depth  values  d„  and  d„.  Table  5 

X  y  X  y 

gives  the  results  for  the  three  different  methods  to  soecify  and  Z,,  (cf. 

X  y 

.'ur:.'.uia  'Z.6y  to  (3.3;;. 


liable  5  here) 


It  turned  out  that  none  of  the  one-sample  quantile  intervals  in  Table  5 
are  disjoint  hence  all  three  methods  fail  to  detect  a  significant  difference 
within  any  of  the  six  pairs.  Brookmeyer  and  Crowley  (1982b,  section  6) 
applied  the  k-sample  (k=i))  median  test  to  this  data  set  and  also  founa  no 
significant  differences  among  the  four  treatments.  Our  conclusion  is 
therefore  in  agreement  with  theirs.  It  is  worth  noting  here  that  the 
quantile-intervals  with  equal  asymptotic  lengths  actually  yield  intervals  with 
quite  uifferent  lengths.  This  suggests  that  higher  order  accuracy  is  needed 
for  the  asymptotic  length  than  that  given  in  Corollary  (ii)  of  section  3-  In 
our  current  example  the  sample  sizes  (and  t^)  for  each  treatment  are  fairly 
even,  hence  the  quantile-intervals  produced  by  all  three  methods  ((3.6)  to 
(3.8))  are  comparable. 

Section  6.  Discussion  and  generalization 

1.  The  quantile-intervals  that  are  defined  in  (2.1)  and  (2.2)  are  based 

on  the  PLEs  F°  and  G°  which  are  step  functions.  Since  the  distribution 
m  n 

0  o 

functions  F  and  G  are  assumed  to  be  continuous  it  may  be  preferable  to  use  a 

* 

continuous  version  (for  example,  linear  interpolation)  of  the  PLE.  Let  F  and 

m 

* 

G^  denote  such  a  version  by  connecting  the  jump  points  (c.f.  Figure  1  of 
Brookmeyer  and  Crowley  (1982b)).  It  follows  from  the  appendix  of  Gastwirth 
and  Wang  (1988)  that 

Sup  |F*(t)  -  F°(t)|  =  0(m"S  a.s.  (6.1) 

o^t<T  ^ 

and 
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,  V.  v; 


.  V.  . 


for  any  o<p^<1 .  In  practice,  one  may  use  the  smoothed  quantile-interval 

*  *  o  * 

[L  ,  U  ]  by  replacing  F  by  F  in  (2.1).  It  is  immediate  from  (6.2)  that  all 
X  X  ^  m  m 

the  asymptotic  results  in  section  3  apply  to  the  smoothed  quantile- intervals 


2.  Theorem  3  shows  that  all  the  intervals  have  the  same  asymptotic 

1  /2 

length  and  N  A  /(2  Z^)  converges  in  probability  to 

[(1-X)t  +  Xt  '^^/[4X ( 1 -X ) ii^(O) which  is  the  reciprocal  of  the  Pitman 

X  y 

efficacy  (c.f.  Hettmansperger  (1984b))  of  the  control  median  test  by  Gastwirth 
and  Wang  (1988).  It  can  be  checked  easily  that  both  median  tests  by 
Brookmeyer  and  Crowley  (1982b)  and  Prentice  (1987)  have  the  same  efficacy  as 
the  control  median  test.  Hence  all  four  median  type  tests  have  the  same 
efficiency . 

3.  The  two-sample  inference  method  based  on  quantile-intervals  was 
developed  only  under  the  location  model.  For  the  Behrens-Fisher  problem  where 
we  test  whether  both  populations  have  the  same  median  but  may  differ  in  shape, 
the  significance  level 


[XT  [f°(e  )]2  +  (i-x)t  [g°(9  )]^}^^^ 
y  A  X  y 


Then  one  must  estimate,  in  addition  to  t  and  t  ,  the  ratio  f°(9  )/g°(0  ), 
where  f°  and  g°  are  the  densities  of  F°  and  G°,  respectively.  The  estimation 
of  f°(9^)/g°(9^)  can  be  accomplished  by  density  estimation  procedures  in  the 
censored  case  (Lo,  Mack  and  Wang  (1986),  Padgett  and  McNichols  (1984))  which 
usually  require  very  large  sample  sizes.  Hence,  the  procedure  in  this  paper 


can  be  used  for  the  Behrens-Fisher  problem  as  in  Hettmansperger  (1973). 


4.  We  demonstrated  in  this  paper  how  to  conduct  two-sample  inference 


methods  based  on  one-sample  procedures.  Although  this  is  done  for  the 
quantile-intervals  (or  sign-tests)  under  the  location  model  it  can  be  applied 
to  other  nonparametric  procedures  (e.g.  linear  rank  tests)  and  models  (e.g, 
scale  model  or  stochastically  ordered  alternative).  The  crucial  step  lies  in 
the  asymptotic  distribution  of  the  endpoints  of  the  one-sample  confidence 
intervals  (cf.  Corollary  in  section  2). 
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Table  1.  Equal  Confidence  Coefficients  to  Produce  a  Size  .05 
Two-sided  Two- Sample  Test  (Y  =  Y  ) 


f 


(ir  ,iT  ) 
x’  y 

.25. .5)  (.25, .75)  (.5, .25)  (.5, .75)  (.75, .25)  (.75, .5) 


Table  2.  Confidence  Coefficients  Corresponding  to  Equal  Lengths 


(.25, .5)  (.25, .75)  (.5, .25)  (.5, .75)  (.75, .25)  (.75, .5] 


834 

.834 

.864 

.806 

VJI 

00 

;737 

.806 

.864 

.933 

.754 

.737 

.958 

.754 

.933 

879 

.794 

.909 

.771 

.983 

.717 

.848 

.821 

.967 

.930 

.763 

.920 

.785 

.890 

91  0 

.770 

.938 

.750 

.993 

.707 

.880 

.793 

.984 

.717 

.786 

.889 

.81  1 

.858 

933 

.754 

.958 

.737 

.997 

.701 

.905 

.774 

.992 

.709 

.806 

.864 

.834 

.834 

950 

.742 

.971 

.727 

.999 

.696 

.92  4 

.76 

.996 

.703 

.826 

.844 

.854 

.815 

n  «.*  K.”  ’ 


Table  3- 

Confidence 

Coefficients 

with  Equal 

Depth  (d^ 

IT  ,  IT  ) 

m=n 

=90 

m=90,  n=60 

X  y 

7 

7 

7 

7 

X 

y 

X 

y 

( .25, .25) 

.834 

.834 

.997 

.997 

(.25, .5  ) 

.864 

.806 

.998 

.993 

(.25, .75) 

.958 

.737 

.999 

.963 

( .5  , .25) 

.806 

.864 

.993 

.998 

(.5  ,.5  ) 

.834 

.834 

.995 

.995 

(.5  ,.75) 

.933 

.754 

.999 

.966 

(  .75, .25) 

.737 

.958 

.946 

.999 

(  .75,  .5  ) 

.754 

.933 

.951 

.998 

(  .75,  .75) 

.834 

.834 

.973 

.979 

Table  4.  Summary  Statistics  and  95%  Confidence  Intervals  for  Four  Treatments 


Treatment _ 1 _ 2 _ 3 _  4 


Sample  size 

53 

56 

58 

52 

Censored  proportion 

16/53 

8/56 

1  4/58 

7/52 

Median  survival  time  (weeks) 

59.18 

39.53 

43.23 

28.52 

Estimated  t 

X 

1  .109 

1  .1  52 

1  .23 

.90 

Depth  dj^ 

19.09 

20.05 

20.33 

20.15 

95J  quantile-interval  [L^,  U^) 

[37.4,72.5) 

[28.6,52.3) 

[25.9,61 .4) 

[24,3,45.0) 

95%  confidence  interval 

by  Brookmeyer  and  Crowley 

[38,73) 

[31 ,51 ) 

[28,60) 

[25,46) 

Table  5.  Two-Sample  Procedures  for  all  Six  Pairwise  Comparisons,  a  =  0.05 


Treatment  1  &  2 


a  d 
X  y 

[L  ,  U  ) 

X  X 

[L  ,  U  ) 

y  y 

Treatment  1  &  3 

Y  Y 
X  y 

d  d 
X  y 

ll  ,  u  ) 

X  X 
[L  ,  U  ) 

y  y 

Treatment  1  &  U 

Y  Y 
:<  j 

d  d 
X  y 

‘•■"x’  '■'x^ 

i-,.  u^) 

Treatment  2  '!e  3 

Y  Y 

X  y 

d  d 
X  y 

--X- 

Treatment  2  4  J 


'  ,  -J  ) 

X  X 
'L  ,  U  ) 

y  y 

Treatment  3  &  ^ 

Y  Y 
X  y 


Equal  coeffioients 


21.69  22.93 

[39.6,  6M.7) 
[34.8,  48.2) 


21.69  23.65 

[39.6,  64.7) 
[35.5,  51 .6) 


21.68  21.76 

[39.6,  64.7) 
[24.7,  42.0) 


.834  .834 

23.65  22.93 

[34.8,  48.2) 
[35.5,  51.6) 


21.76  22.93 

[34.8,  48.3) 
[24.7,  42.0) 


23.6'4  21.76 

[35.5,  51 .6) 


Equal  lengths 


21.71  22.91 

[39.6,  64.6) 
[34.8,  48.3) 


21.67  23.67 

[39.6,  64.8) 
[35.5,  51.6) 


21.92  21.52 

[39.8,  64.0) 
[24.6,  42.4) 


.831  .837 

23.69  22.89 

[34.8,  48.3) 
[35.5,  51.5) 


21.54  23.16 

[35.0,  47.8) 
[24.6,  42.4) 


i’3.93  21.50 

[35.7,  51.1) 
[24.6,  42.5) 


Equal  depths 


22.29  22.29 

[40.3,  63.3) 
[34.2,  49.4) 


22.62  22.62 
[41,1,  62.6) 
[32.0,  58.2) 


21.72  21.72 

[39.6,  64.6) 
[24.7,  42.1) 


23.29  23.29 
[35.1,  47.6) 
[35. 1  ,  52.2) 


22.32  22.32 

[34.2,  49.3) 

[24.8,  40.3) 


22,65  22.65 

[32.2,  58.2) 
[24.9,  38.9) 
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Kaplan-Meier  product  limit  estimator  and  have  the  same  Pitman  efficiency  as 
the  intervals  found  by  inverting  the  sign  test.  Two-sample  tests  and  con¬ 
fidence  intervals  for  the  difference  in  median  survival  times  are  then 
developed  based  on  the  comparison  of  the  one-sample  confidence  intervals. 
Several  methods  for  choosing  the  confidence  coefficients  of  the  correspond- 
ing  one-sample  confidence  intervals  are  developed.  The  Pitman  efficiencies 
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of  these  two-sample  tests  are  the  same  as  that  of  the  median  test  proposed 
by  Brookmeyer  and  Crowley  (1982b).  The  procedures  can  also  be  used  for  the 
Behrens-Fisher  problem. 

Nonparametric  two-sample  inference  procedures  are  useful  in  comparing 
the  responses  of  treatment  and  control  groups.  In  medical  follow  up 
studies  the  data  are  usually  subjected  to  censoring.  While  many  of  the 
two-sample  procedures  have  been  extended  to  accommodate  censored  data  we 
show,  in  this  paper,  how  to  construct  two-sample  tests  and  confidence 
intervals  based  on  one-sample  confidence  intervals.  This  method  was  first 
discussed  in  Hettmansperger  (1984a)  for  uncensored  data  where  a  confidence 
interval  for  the  difference  in  population  medians  is  constructed  by  sub¬ 
tracting  the  endpoints  of  one  (one-sample)  interval  from  the  opposite 
endpoints  of  the  other.  The  test  then  rejects  the  null  hypothesis  of 
equal  medians  if  the  one-sample  intervals  are  disjoint.  Hettmansperger 
used  the  sign-interval  as  the  one-sample  interval  which  is  obtained  by 
inverting  the  sign-test.  In  the  presence  of  censoring  we  modify  the  sign- 
interval  to  the  so  called  quantile-interval  whose  endpoints  are  the 
quantiles  of  the  product-limit  estimator  of  Kaplan  and  Meier  (1958). 

After  deriving  the  asymptotic  properties  of  the  quantile- interval  we 
show,  for  specified  overall  level  a,  three  ways  to  select  the  confidence 
coefficients  for  the  one-sample  quantile  intervals.  For  a  =  0.05  and 
equal  confidence  coefficients  for  both  samples,  the  one-sample  confidence 
coefficients  are  in  the  neighborhood  of  .85  under  the  proportional  hazard 
model.  The  procedures  are  then  applied  to  data  from  a  colorectal  cancer 
clinical  trial  to  compare  four  treatments. 


